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Abstract 

The complex interactions involved in regulation of a cell's function are captured by its interaction graph. 
More often than not, detailed knowledge about enhancing or suppressive regulatory influences and coop- 
erative effects is lacking and merely the presence or absence of directed interactions is known. Here we 
investigate to which extent such reduced information allows to forecast the effect of a knock-out or a com- 
bination of knock-outs. Specifically we ask in how far the lethality of eliminating nodes may be predicted 
by their network centrality, such as degree and betweenness, without knowing the function of the system. 
The function is taken as the ability to reproduce a fixed point under a discrete Boolean dynamics. We 
investigate two types of stochastically generated networks: fully random networks and structures grown 
with a mechanism of node duplication and subsequent divergence of interactions. On all networks we find 
that the out-degree is a good predictor of the lethality of a single node knock-out. For knock-outs of node 
pairs, the fraction of successors shared between the two knocked-out nodes (out-overlap) is a good predictor 
of synthetic lethality. Out-degree and out-overlap are locally defined and computationally simple centrality 
measures that provide a predictive power close to the optimal predictor. 
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1. Introduction 

The survival, functioning and growth of a living 
cell is based on a large set of interdependent bio- 
chemical interactions. Interaction networks [1' have 
proven to be useful summary pictures of such a bio- 
chemical system or part of it, especially when inter- 
actions are known qualitatively while precise quan- 
titative information is scarce. For many systems, 
the interaction network suffices to capture essen- 
tial features of dynamical behaviour [2 such as the 
presence of a stable stationary state, multistability, 
oscillations etc. Often such predictions do not even 
depend on the whole network structure. Qualita- 
tive statements on system behaviour may be based 
on the centrality of nodes [3J Uj or the presence of 
certain small subnetworks, called motifs [S]. 

Here we ask to what extent a limited knowledge 
of biochemical interactions is usable for predicting 
the reaction of a system to failure of one or several 
of its components [HI 13 [H]. This kind of theory 
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serves to complement experiments with knock-outs 
in vivo or in vitro [5] . A knock-out (or knock-down) 
is performed by blocking (or reducing) production 
of a single protein. Depending on the viability of 
the cell after suffering the modification, knock-outs 
are subject to a binary classification into lethal and 
viable. 

When knocking out several nodes (proteins) of a 
system simultaneously, a richer classification arises 
from considering the lethality of this combined 
knock-out together with the effect of each single 
knock-out. Synthetic lethality |10| is the class of 
lethal simultaneous knock-out of two nodes i and 
j, where independent knock-out of node i alone is 
viable and independent knock-out of node j alone 
is viable. Synthetic lethality has direct implications 
for target identification in anticancer drug discov- 
ery |TT] . Since the experimental effort of a complete 
scan of double knock-outs is quadratic in the num- 
ber of proteins, an accurate computational predic- 
tion of candidate pairs can greatly reduce the cost 
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of experiments. 

Here we study prediction of lethality and syn- 
thetic lethality in stochastically generated interac- 
tion networks. Knowledge is taken to be incomplete 
in the sense that only the absence or presence of 
interactions but not the type (enhancer / suppres- 
sor) is available. As predictors, we test efficiently 
computable network centrality measures based on 
degree and betweenness. Quality of predictors in 
terms of ROC curves (see Section [6]) is held against 
the optimal prediction possible with the available 
knowledge. Additionally, we use evolutionary dis- 
tance between nodes as a predictor. Thereby we 
find out how much the knowledge of paralogs sup- 
ports the identification of synthetically lethal pairs. 

Our notion of viability and lethality is based on 
a functional phenotype that we define here as a sta- 
tionary state of the unperturbed dynamical system. 
Regulatory interactions are mimicked by Boolean 
threshold dynamics that serves as a suitable testbed 
for the studies of robustness of networked biological 
systems jlj and for evolutionary studies [T5] . 

2. Network construction 

Throughout this contribution, a network on n 
nodes is given by an n x n matrix W. Each matrix 
entry Wij takes a value in {—1, 0, -1-1} where 



^1, if J is an enhancer of i 
Wii — ■{ —1, if j is a suppressor of i 
0, otherwise 



(1) 



Reduced information about interactions is repre- 
sent by assigning the network W a directed graph 
that we identify with its adjacency matrix A. The 



entries of A are given by aj 



Thus A con- 



tains the information about the absence or presence 
but not the type of a directed interaction. The den- 
sity p{W) of a network 



piW) 



1=1 3 = 1 



Wi 



(2) 



measures the fraction of interactions established out 
of the possible ones. We generate networks with 
the following two stochastic procedures. 

2.1. Random networks 

A random network W is generated by indepen- 
dently assigning each entry Wij a value -1-1 with 
probability p/2, a value —1 with probability p/2 
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Figure 1 : Example of network generation of a network with 
duplication and divergence, (a) The network at time-step 
t = is initialized with two asymmetrically coupled nodes. 
Afterwards it grows by successive duplication and divergence 
steps, (b) The growth process in terms of the network ma- 
trix W . (c) In the evolutionary tree, each leaf represents 
an extant node in the network. Inner nodes are common 
ancestors. 



and a value with probability 1 — p. The model 
parameter p is to be chosen from [0, 1] and deter- 
mines the average density of the random network 

IHfiSI. 



2.2. Networks from duplication and divergence 

An alternative statistical ensemble of networks is 
generated by duplication and divergence. This is 
motivated by the observation that an evolutionary 
extension of the repertoire of regulatory sequences 
is obtained by dupfication [T7 1 [T5 1 [T^ [20] . 

For generating a network by duplication and di- 
vergence (DaD) we start with a 2 x 2 matrix repre- 
senting two mutually but unequally coupled nodes, 
i.e. Wi2 — 4-1, W21 — ~1 and Wn — W22 — 0. 
Then the following process of duplication (i) and 
divergence (ii) is iterated. 

(i) A node i of the network with n—1 nodes is ran- 
domly drawn from a flat probability distribu- 
tion. Node i is duplicated, generating an addi- 
tional row and column with index n in the ma- 



trix W. The new entries are 



for all 1 < J < n, and w„ 



; and 

Wi... 



(ii) For each index pair (fc, I) with k = n or I ~ n: 
if \wki\ — 1, we set Wki ■— with probability r 
and leave Wki unchanged with probability 1 — r. 
Otherwise {w^i = 0), we set w^.i '.= +1 with 
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Figure 2: Average densities for networks generated with du- 
plication and divergence, (a) Average density with fixed 
removal probabilities r. (b) Average density with fixed addi- 
tion probabilities a. All points are averages over lO'^ network 
realizations. 



probability a/2, w^i := —1 with probability 
a/2 and leave Wki unchanged with probability 
1 - a. 

Step (i) implements gene duplication, in which 
both the original and the replicated proteins retain 
the same structural properties and the same set of 
interactions. The divergence steps (ii) implements 
the possible mutations of the replicated gene, which 
translate into the addition and removal of interac- 
tions with probabilities a and r. An example of 
this process is shown in Figure [T] Special attention 
is given to the handling of loops. If the randomly 
chosen original node has a loop, the loop is copied 
as well as two additional links with the same edge 
weight between the original and the replica node. 

The average density of networks generated with 
duplication and divergence is shown in Figure [2] as 
a function of parameters a and r. 

3. Knock-outs, dynamics, and functionality 

3.1. Knock-outs 

In a real biochemical interaction network, knock- 
ing out a node means that the concentration of the 
reactant represented by the node is set zero. For 



our purposes, it is equivalent to remove all outgo- 
ing interactions (arcs) of the node from the network. 
The knock-out of node k in network W leaves the 
network as ly^^*^} with matrix entries w}j-''^ — Wij 
if J ^ fc and otherwise. As a generalization, sev- 
eral nodes forming a set K C {1,2, ... ,n} may be 
knocked out. The resulting network W^^ has en- 
tries 

^ ( w,j lijiK 
'J 1 otherwise. 



(3) 



3.2. Dynamics 

In the present work we model gene regulatory 
networks by threshold dynamics [21]. This is a 
special case of Boolean dynamics [221 US] • A mul- 
titude of formulations for Boolean threshold net- 
works exist, depending how the thresholds are dis- 
tributed, how the behaviour at the threshold is de- 
fined and which weights for the edges are allowed. 
We choose the version as applied in the simplified 
yeast cell cycle network [53] and many succeeding 
studies [HHillinillZ], compare Equation [I] 

A node is activated, Si — 1, if the sum of its 
weighted inputs exceeds a threshold value (here: 
threshold is zero) . It is inactivated if the input sum 
falls below the threshold. In the case when the sum 
gives exactly the threshold value, the node value 
remains unchanged. Thus the Boolean state si of 
node i at time step t evaluates to 



lif Ef=i«..s, W>o 



oif Er=i«.j5. W <o 



(4) 



3.3. Functionality and lethality 

A state s* is a fixed point if it remains unaltered 
by the dynamics, i.e. the successor state of s* is 
s* itself. We define the function (in the sense of 
a phenotype) of a network W to be a fixed point 
state s* 7^ (0, . . . , 0). After generation of a network, 
a fixed point s* is found as described in Section [6| 

Given a network W and its functional fixed point 
s* , we say that a knock-out K C {1, . . . , n} is viable 
(for W, s*) if s* is a fixed point of W^^ . Otherwise 
K is lethal. We say that K is synthetically lethal, if 

(i) K is lethal and 

(ii) K' is viable for all proper subsets K' C K. 

Analogously one may define synthetic viability. A 
knock-out K is synthetically viable, if 
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Figure 3: Probability of lethal single node knock-outs as a 
function of network density p. All values are averages over 
10® realizations at the given value of p. Holding the addition 
(removal) probability a (r) constant limits the interval of 
possible densities. 

(i) K is viable and 

(ii) K' is lethal for all proper subsets K' C 
K' 

Thus synthetic lethality and synthetic viability are 
defined for arbitrary cardinality \K\ > 2 of knock- 
outs. In this contribution, however, only single and 
double knock-outs are considered. 

4. Results 

^.1. Statistics of lethal knock-outs 

We start by presenting the effect of single knock- 
outs in the two models of networks (random and 
DaD). Figure [3] shows, as a function of the density 
of the network, the probability that a single knock- 
out is lethal. For random networks, this probability 
increases with the arc density p. For all choices of 
parameters investigated here, DaD networks have 
fewer lethal knock-outs than random networks at 
the same density. Under constant probability a of 
adding interactions, the DaD networks even exhibit 
a decreasing number of lethal knock-outs with in- 
creasing density. 

Note that the probability of knock-outs being 
lethal cannot exceed 1/2 because on average half 
of the nodes are in the off-state on the functional 
fixed point. Knock-out of a node in the off-state in 
the network does not affect the states of the other 
nodes. This theoretical maximum, however, is not 
reached. Random networks at the largest possible 
density 1 have a probability of ~ 0.41 for a knock- 
out to be lethal. 
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Figure 4: Lethality of knock-outs as a function of network 
density p. Open symbols refer to lethal double knock-outs 
while solid symbols mean that the double knock-out is still 
viable. The shape of the symbols distinguishes between re- 
sults of single-node knock-outs: neither single knock-out (cir- 
cle), only one knock-out (square) or both single knock-outs 
are lethal (triangle). All values are averages over 10*^ real- 
izations at the given value of p. 



Table 1: Overview of the area under the ROC curves for prediction of single node knock-outs. Each, pair of rows is for networks 
with a given expected density p. The first row of each pair refers to random networks with parameter value p. The second row 
of each pair is for DaD networks with parameter values a and r. Higher values of the area refer to a higher accuracy of the 
prediction. 

struct, lethality out-deg. out + in-deg. out — in-deg. betw.centr. in-deg. 



p = 


0.14 


0.672 


0.649 


0.621 


0.591 


0.566 


0.483 


a = 


0.05, r = 0.58 


0.732 


0.708 


0.634 


0.662 


0.593 


0.486 


P = 


0.23 


0.623 


0.612 


0.587 


0.572 


0.560 


0.492 


a = 


0.1, r = 0.58 


0.681 


0.666 


0.608 


0.626 


0.577 


0.494 


P = 


0.26 


0.612 


0.602 


0.579 


0.566 


0.557 


0.494 


a = 


0.05, r = 0.25 


0.689 


0.682 


0.635 


0.617 


0.598 


0.534 


P 


0.35 


0.588 


0.581 


0.562 


0.553 


0.548 


0.496 


a = 


0.1, r = 0.25 


0.648 


0.642 


0.602 


0.596 


0.574 


0.518 



Table 2: Overview of the area under the ROC curves for prediction of double node knock-outs which exhibit synthetic lethality. 
Each pair of rows is for networks with a given expected density p. The first row of each pair refers to random networks with 
parameter value p. The second row of each pair is for DaD networks with parameter values a and r. Prediction based on 
evolutionary distance is only applicable for networks generated with duplication and divergence. Results which incorporate 
prior knowledge of the result of single node knock-outs are shown in brackets. Higher values of the area refer to a higher 
accuracy of the prediction. 









struct, syn. let. 


out-overlap 


repl. centr. 


evol. distance 


in-overlap 


p = 

a = 


0.14 
0.05, r = 


= 0.58 


0.888 (0.895) 
0.915 (0.922) 


0.859 (0.865) 
0.896 (0.903) 


0.597 (0.600) 
0.594 (0.597) 


0.601 (0.601) 


0.500 (0.499) 
0.531 (0.530) 


P = 
a = 


0.23 
0.1, r = 


0.58 


0.778 (0.787) 
0.857 (0.867) 


0.742 (0.752) 
0.832 (0.841) 


0.582 (0.586) 
0.588 (0.591) 


0.550 (0.551) 


0.501 (0.500) 
0.519 (0.519) 


P = 
a = 


0.26 
0.05, r -- 


= 0.25 


0.743 (0.752) 
0.799 (0.812) 


0.705 (0.717) 
0.779 (0.789) 


0.576 (0.581) 
0.605 (0.609) 


0.613 (0.611) 


0.500 (0.500) 
0.573 (0.572) 


P = 
a = 


0.35 
0.1, r = 


0.25 


0.673 (0.681) 
0.735 (0.748) 


0.632 (0.646) 
0.707 (0.724) 


0.559 (0.563) 
0.583 (0.587) 


0.570 (0.568) 


0.500 (0.500) 
0.546 (0.546) 
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Figure 5: ROC curves for prediction of lethality of single 
node knock-outs. Average network density is p = p for the 
random networks (panels in left column). In the correspond- 
ing panel in the right column, results for DaD networks with 
the same density are shown. Predictors are structural lethal- 
ity (solid curve, optimal predictor), out-degree (dotted), and 
betweenness centrality (dashed). The solid diagonal is the 
line of no discrimination. Each curve is based on 10^ network 
realizations. The synthetic lethality is estimated by extract- 
ing the graph from each network and probing another 10^ 
network realizations with the same graph structure. 
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Figure 6: ROC curves for prediction of synthetic lethality. 
Average network density is p = p for the random networks 
(panels in left column). In the corresponding panel in the 
right column, results for DaD networks with the same density 
are shown. Predictors are structural synthetic lethality (solid 
curve, optimal predictor), out-overlap (dotted), replacement 
centrality (short-dashed), and evolutionary distance (long- 
dashed). The solid diagonal is the line of no discrimination. 
Each curve is based on 10* network realizations. The struc- 
tural synthetic lethality is estimated by extracting the graph 
from each network and probing another 10"^ network realiza- 
tions with the same graph structure. 
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Now we turn to the statistics for double knock- 
outs {v, w} in combination with the single knock- 
outs {v} and {w}. In each of the panels of Fig- 
ure|4j the open circles connected by solid curves give 
the fraction of synthetically lethal pairs of nodes in 
networks of a given density p. Synthetic lethality 
becomes more abundant with increasing density in 
random networks (top panel) and in DaD networks 
with fixed arc removal probability (two lower pan- 
els). 

Synthetic lethality is just one possible outcome 
of knock-out tests performed on a pair {v,w}. Of 
the single-node knock-outs {v} and {w}, none, ex- 
actly one or both may be lethal. Combination of 
this ternary result with the binary outcome (lethal/ 
viable) of the two-node knock-out {v, w} yields six 
possible scenarios. The statistics of these scenar- 
ios is plotted in Figure |4j Interestingly, synthetic 
viability (dotted line with triangles) becomes more 
abundant than synthetic lethality in dense random 
networks. 

4-2. Prediction 

Now we investigate the predictability of the 
lethality, first of single, then of double knock-outs. 
Predictability is strongly dependent on the avail- 
able knowledge. If the network W and the func- 
tional fixed point s* arc fully known, perfect predic- 
tion of lethality is possible simply by computing the 
effect of the knock-out. In a more realistic scenario, 
only partial knowledge is available which we model 
here as follows. The presence or absence of each in- 
teraction is available with absolute accuracy while 
the type (enhancer/ suppressor) of each present in- 
teraction is totally unknown; information on the 
functional fixed point is not available. With the 
formalism described in section [2j for a network W 
only the adjacency matrix A of the directed graph 
is known. Then the best predictor of lethality of 
a knock-out {i} is the relative frequency li of {i} 
being lethal in networks W' that have adjacency 
matrix A and interaction types randomly assigned. 
In other words, {i} is predicted as lethal in W, if 
{i} is typically lethal in networks with the same ad- 
jacency matrix as W. We call k structural lethality. 

Computation of li may be impossible or imprac- 
tical in real scenarios. Therefore the value of a cen- 
trality measure at node i is extracted from A and 
used for prediction instead of i. This incurs an- 
other step of knowledge reduction. Here it is salient 
to choose the "right" centrality measure for predic- 
tion. We consider the betweenness centrality 6^, 



the out-degree d™* the in-degree dj", furthermore 
the degree sum d™* -I- dj" and the degree difference 
d°"' — d™ . These are defined in Section [g] as well 
as other quantities used here. The predictive power 
of the different measures is summarized in Table [T] 
for random and DaD models with varied parameter 
values. The deviation of a value from 0.5 indicates 
that the quantity contains information about the 
lethality of nodes in the given scenario. This is the 
case for all the centrality measures under consid- 
eration except for the in-degree. In DaD networks 
the chance to predict the effect of a knock-out is 
larger than in random networks of the same den- 
sity, e.g. a = 0.1, r = 0.25 leads to networks with 
density p « 0.35 which show a larger predictability 
than random networks with the same density. Best 
predictions are based on the out-degree whose pre- 
dictive power almost reaches the best possible ob- 
tained by the structural lethality. Predictive power 
is measured as the area under the ROC curve of 
sensitivity versus specificity of the prediction which 
are shown in Figure [5] 

Now we study the prediction of synthetically 
lethal pairs. The framework is mostly analogous 
to that of single knock-outs. Again we assume that 
the adjacency matrix but not the full network (with 
interaction types) are known. Eligible predictors 
are now measures of joint centrality, i.e. mapping a 
given unordered node pair {i,j} in a given graph A 
to a number. Here we investigate the out-overlap 
0°"*, the in-overlap oj" and the replacement cen- 
trality rij as defined in section [6[ Furthermore, 
the evolutionary distance Cij is used as a predictor 
in networks evolved by duplication and divergence 
(DaD). Tabled summarizes the predictive power 
of these centrality measures, again in comparison 
with that of the optimal predictor Sij here called 
structural synthetic lethality. As the main result, 
the out-overlap o°"* is the best predictor of syn- 
thetic lethality in all cases considered, its predic- 
tive power is close to optimal in all cases consid- 
ered. Prediction of synthetic lethality is facilitated 
in DaD networks as compared to random networks. 

An interesting alternative scenario arises under 
the assumption that we already know all viable 
single node knock-outs, y(Vl^) = {i : {i} viable} 
in each network W considered. For the prediction 
of synthetically lethal pairs, candidates are subsets 
{hj} ^ ^(W^), i 7^ j- The predictive power for 
this scenario with restricted candidate set is given 
in brackets in Table [5] Prior knowledge of viable 
single-node knock-outs does not induce a significant 
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increase of predictive power for any of the combi- 
nations of predictor, network generation model and 
density. 

For a more detailed view of sensitivity and speci- 
ficity, Figure [6] shows the ROC curves of selected 
predictors in the case without prior knowledge of 
viable single knock-outs. 



5. Concluding remarks 



6.2. Evolutionary distance 
Along with the generation of a network W with 



the DaD model (Section 2.2), the evolutionary tree 
T is constructed, cf. the example in Figure [TJ The 
nodes {1, 2, . . . , n} of the network W are the leaves 
of the tree T. The evolutionary distance Cij is de- 
fined as the length of the path between between 
leaves i and j on T. Each edge on T is counted 
with unit length. 



The present contribution has established a the- 
oretical framework for assessing predictability of 
knock-out effects in networked regulatory systems. 
We covered a broad range of scenarios in terms of 
network structures and predictors. Nevertheless it 
must be emphasized that outcomes depend on the 
choice of specific definitions made. 

First of all, the definition of lethality is made 
in the context of the regulatory network, assuming 
that the only task of a node is regulation within 
the system considered. If the disabled protein is 
involved otherwise, e.g. as a structural protein, its 
potentially lethal knock-out cannot be predicted in 
the present framework. 

Secondly, the definition of functionality of the 
network as the presence of a fixed point is not the 
only reasonable choice. For instance, we may de- 
mand that the fixed point s* be stable in the sense 
that the system returns to s* after a perturbation at 
one node's state. Rather than a fixed point, a par- 
ticular temporal sequence of states may be defined 
as the functionality of the system [551 IMl HI US] ■ 



6. 3. Measures of centrality 

Degree centralities. They measure importance in a 
linear fashion, assuming that a node with twice the 
number of links also is twice as important. Sev- 
eral different degree centralities can be defined by 
using a function of the in-degree dj" and the out- 
degree d™* of a node i for a given adjacency matrix 
A: 



2^ fly 



(5) 



Overlaps. We distinguish the in-overlap o'" be- 
tween nodes i and j as 

iiw.N |{fc : Qfc» = 1 Aflfcj = 1}| 
\{k : flfe^ = 1 V Okj 1}| 

and the out-overlap o°"* 

OUt/.N ^ |{fc : Oifc = 1 A Ojk = 1}| ,„^ 



6. Methods 



If the denominator is zero, the whole expression is 
defined to be zero. 



6.1. Finding functional fixed point 

After generating a network W, the functional 
fixed point s* is assigned as follows. An initial state 
vector s(0) € {0, 1}" is drawn uniformly. The dy- 
namics is run from s(0) by iterating Equation Q 
until a state is seen the second time and an attrac- 
tor is reached, i.e. at times ti > t2 > such that 
s(ti) — s{t2). If the attractor is a non-trivial fixed 
point, s{t2) = s{t2 — 1) 7^ (0, 0, . . . , 0), we take it 
as the functional fixed point s* := s{t2). Other- 
wise the network W is discarded and replaced by 
another random instance. 



Betweenness centrality. It quantifies the fraction of 
shortest paths that pass through this node j30j. 

bM) = E ^ , (8) 

where the sum runs over all ordered node pairs 
{j,k); fjj]. denotes the total number of shortest 
paths from node j to node fc; <Jjk{i) is the number 
of such paths running through node i. A modified 
Floyd- Warshall algorithm |3T1 allows to simultane- 
ously compute the lengths and numbers of shortest 
paths. 
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Replacement centrality. Let us define the replace- 
ment centrality of a pair of nodes as 



(9) 



if the denominator in Equation ([9| is non-zero and 
rij{A) = otherwise. 

Structural synthetic lethality. When only knowing 
the graph A, the best predictor of synthetic lethal- 
ity for nodes i and j is given by the fraction of net- 
works exhibiting synthetic lethality at i and j out 
of all networks compatible with A. This fraction is 
called structural synthetic lethality and is formally 
defined as 



_ \{W G U{A)\{iJ} synth. lethal in W}\ 

'^'^ ' ~ mA)\ 

(10) 

where J^{A) is the set of all networks that map to 
the graph A. 

6.4- Receiver Operating Characteristic (ROC) 

The sensitivity of a prediction is the fraction of 
cases for which the outcome is positive and correctly 
identified. Similarly, the specificity is the fraction 
of cases correctly identified as negative. A Receiver 
Operating Characteristic (ROC) [35] is the collec- 
tion of all tuples of (specificity, sensitivity) obtained 
by varying a threshold on the quantity used as a 
predictor. 

Formally, we consider a set of objects S with 
a binary partition, i.e. subsets S*"*" and with 
5+ n 5"" = and S+ U = S. As a predictor of 
this partition, we consider a function w : — >■ M and 
a threshold value i? S M. An object x G S is pre- 
dicted as positive (-I-) if v{x) > The sensitivity 
measures the fraction of objects from 5*+ predicted 
to be positive (-I-), i.e. 



\{x e 5*+ : v{x) > t?}| 
\S^\ 



(11) 



and analogously, the specificity is the fraction of 
objects from S~ predicted to be negative (— ), 



\{x e S- : v{x) < 



(12) 



For the prediction of lethality, S is the set of all 
single knock-outs in all network realizations con- 
sidered, 5*+ are the lethal knock-outs, 5~ the vi- 
able ones. Analogously in the context of double 



knock-outs, 5*+ are the synthetically lethal cases. 
The function v is the predictor used, such as out- 
degree, in-degree etc. ROC plots show sensitivity 
as a function of specificity subtracted from 1. 
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